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ABSTRACT 

Weak gravitational lensing has become an important method to determine the 
masses of galaxy clusters. The intrinsic shapes of the galaxies are a dominant source 
of uncertainty, but there are other limitations to the precision that can be achieved. 
In this paper we revisit a typically ignored source of uncertainty: structure along the 
line-of sight. Using results from the Millennium Simulation we confirm the validity of 
analytical calculations that have shown that such random projections are particularly 
important for studies of the cluster density profile. In general the contribution of large- 
scale structure to the total error budget is comparable to the statistical errors. We 
find that the precision of the mass measurement can be improved only slightly by 
modelling the large-scale structure using readily available data. 
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1 INTRODUCTION 

The number density of galaxy clusters as a function of mass 
and redshift can be used to constrain cosmolo gical param- 



eters and probe the growth of structure (e.g.. iHenrv 



Borgani et al.ll200ll;lGladders et al.ll2007l ; iHenrv et al 1120091 
Mantz et al.1 |20ld : Ivikhlinin et alj|2009h . This approach is 



2000; 



conceptually straightforward, but the actual implementation 
of this method is more difficult. This is because the clusters 
are identified based on their baryonic properties (e.g., galaxy 
counts, SZ decrement, X-ray luminosity or temperature), 
which need to be related to the underlying dark matter dis- 
tribution. The relation between observed cluster properties 
and the mass depends itself on the relative importance of 
the various physical processes that play a role in galaxy and 
cluster formation. Galaxy clusters provide an excellent lab- 
oratory to study these, because multi-wavelength observa- 
tions provide us with a complete census of the various com- 
ponents: we can actually observe the stars, gas and dark 
matter content. 

The use of clusters to constrain cosmology and to im- 
prove our understanding of cluster physics are closely inter- 
related: feedback processes and differences in formation his- 
tory lead to variations in the observable properties at a given 
mass. The resulting intrinsic scatter changes the selection 
function and thus leads to biased constraints on cosmologi- 
cal parameters if left unaccounted for. Correctly interpreting 



the scatter in the scaling relations requires a good under- 
standing of the various sources of uncertainty, which can be 
either physical or statistical. 

A variety of methods can be used to determine the 
mass of a cluster. Most of these are based on dynamics and 
assume the cluster is relaxed. In this case, the mass can 
obtained from the velocity dispersion of the cluster galax- 
ies. Measurements of the gas pressure, obtained from ob- 
servations of the hot X-ray emitting intracluster medium 
(ICM), provide another powerful tracer of the dark matter 
content. This approach, which assumes hydrostatic equilib- 
rium, has been used extensively, thanks to the high quality 
observations obtained using powerful X-ray telescopes such 
as the Chandra X-ray Observatory and XMM-Newton (e.g., 
IVikhlinin et alj|2009l : [Mantz et alj|2010l ). The interpretation 
of such measurements is often complicated by the presence 
of substructures and the fact that most clusters are not re- 
laxed. A major concern is the assumption of hydrostatic 
equilibrium, because numerical simulations have shown that 
active galactic nuclei, turbulence, and bulk motions of the 
gas, as well as variations in the merging history can lead 
to systematic u nderestimates of masses based on X-ray ob- 
servations (e.g., lEvra rd 1990; Dolag et al. 2005; Rasi a et al.l 
l2006l;lNagai et al.ll2007l ). Although recent simulations incor- 
porate a wide range of physical processes it is not clear to 
what extent the simulations provide a realistic estimate of 
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the systematic error in the mass. It is therefore critical to 
compare dynamical techniques to methods that do not suffer 
from these problems. 

This is possible thanks to a phenomenon called weak 
gravitational lensing: the cluster mass distribution perturbs 
the paths of photons emitted by distant galaxies. As a re- 
sult the images of these background galaxies appear slightly 
distorted (or sheared). Unlike dynamical methods, gravita- 
tional lensing does not require one to make assumptions re- 
garding the dynamical state of the cluster. The amplitude of 
this distortion provides us with a direct measurement of the 
gravitational tidal field, which in turn provides us with an 
estimate for the projected cluster mass. The recent improve- 
ments in sample size and precision allowed iMahdavi et al.l 
(2008) to compare weak lensing and hyd r ostat ic masses for 
a sample of 18 clusters. IMahdavi et al.l (|2008T ) found that 
at large radii the X-ray results underestimate the mass, 
in agreement with the findings from numerical simulations. 
These findings demonstrate the usefulness of weak lensing 
for multi-wavelength studies of galaxy clusters. 



2 COSMIC NOISE 

The observed lensing signal is the combination of the shear 
induced by the cluster mass distribution and that of other 
structures along the line-of-sight. The expectation value of 
the latter vanishes, but it does introduce additional vari- 
ance in the cluster mass estimate. The effect of this cosmic 
noise on w eak lensing cluster st udies can be quantified an- 
alytically (Hoekstra 2001, 120031 ') or using numerical simu- 
lations ( White et al.l 20021 ) . Not surprisingly, these studies 
have shown that the cosmic noise is most important when 
the cluster signal itself becomes small: i.e., when data at 
large cluster-centric radii are used, or when clusters at low 
redshifts are studied. Cosmic noise, however, is also a con- 
siderable source of uncertainty for clusters at intermediate 
redshifts. 

Even for a massive cluster, the induced change in the 
shape of a source galaxy's image is typically small com- 
pared to its intrinsic ellipticity. It is therefore convenient 
to azimuthally average the tangential shear 7t and study 
its variation as a function of radius. It can be related to the 
surface density through 



Weak lensing masses ar e now routinely measured for 
large samples of clusters (e.g. Hoekstra 2007; lBardeau et al.l 
120071 ; lOkabe et all l201Ch . Tests on simulated data have 
shown that the best analysis methods can reduce sys- 
tematic errors in the shear measurement s to ~ 1 — 2% 
jHevmans et all 120061 ; iMassev et all 120071 '). Much of the 
recent progress has been driven by the desire to mea- 
sure the weak lensing signal caused by intervening large- 
scal e structure, a.k.a. cos mic shear (for a recent review 
see iHoekstra fc Jain] I2008T ). The cosmic shear signal has 
now been detected at high sign i ficance in a number of sur- 
veys (e.g.. IHoekstra et al]|2002l; I Van Waerbeke et alJ 2005 ; 



Hoekstra et al.ll2006l; iBeniamin et al.l 120071: iFu et alJhoQg 



Schrabback et all l2010t l and is one of the most promising 



tools to study dark energy. 

This cosmological signal, however, limits the accuracy 
with which cluster masses can be determined: the observed 
lensing signal is a combination o f the cluster sign al and cos- 
mic shear. As first discussed in iHoekstral (2001) the large- 
scale structure along the line-of-sight is an additional source 
of noise, but does not bias the measurement of the mass. As 
shown by IHoekstral (|2003f ) this 'cosmic noise' is particularly 
relevant when s tudying cluster mass density profiles (also see 
lDodelsorJ|2003 ). Although the effects of uncorrelated struc- 
tures along the line-of-sight are often acknowledged, their 
contribution to the formal error budget has typically been 
ignored. This is somewhat surprising, given that there is lit- 
tle doubt that cosmic shear has been measured. 

In th i s paper we revisit the predictions presented in 
IHoekstral (|200lL 120031 ') using ray-tracing results from the 
Mille nnium Simulation (jSpringel et al.l 120051 : iHilbert et al.l 
2009), demonstrating in §3 once more that cosmic noise 
should not be ignored in weak lensing studies. We also quan- 
tify for the first time the noise introduced by the finite sam- 
pling of the source redshift distribution. In §4 we examine 
whether cosmic noise can be suppressed using readily avail- 
able data. 



(7T>(r) = 



E(< r)-E(r) 



= «(< r) — ft(r-), 



(1) 



where E(< r) is the mean surface density within an aperture 
of radius r, and E(r) is the mean surface density on a cir- 
cle of radius r. The convergence k, or dimensionless surface 
density, is the ratio of the surface density and the critical 
surface density E cr it, which is given by 



Ecrit — 



4ttG DiD h 



(2) 



where Di is the angular diameter distance to the lens. D s 
and Dis are the angular diameter distances from the ob- 
server to the source and from the lens to the source, respec- 
tively. 

The variance in the azimuthally averaged tangential 
shear in an annulus ranging from r± to r 2 caused by large- 
scale structure along the line-of-sight is given by (jHoekstral 
l2003h : 

p OO 

at ss (ri,r 2 ) = 2n dl lP K (l)g 2 (l, ri ,r 2 ), (3) 
Jo 

where the convergence power spectrum P K {1) is given by: 



P K (l) = 



4c 4 



d w 



W(w) 
a(w) 



ft 



(4) 



Here w is the radial coordinate, a(w) the cosmic scale fac- 
tor, and /x(w) the comoving angular diameter distance. 
Ps(l;w) is the matter power spectrum. We consider rela- 
tively small scales, and therefore need to account for the non- 
linea r evolution fe.g. | Jain fc Seliakl 1 19971 : [Schneide r et al.l 
1 19981 : IHilbert et alj|2009h . W(w) is the average ratio of an- 
gular diameter distances Di s /D a for a redshift distribution 
of sources p w (w): 



W(w)= dwp w (w) — v ' ■ 



(5) 
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The function g{l,r%,r2) in Eqn. © is a filter of the 
convergence power spectrum and is specified by our choice to 
co nsider the azimu thally averaged tangential shear. We refer 
to lHoekstral (|2003h for a more detailed discussion, including 
the expression for g{l,r\,r2)- In this paper we measure the 
azimuthally averaged tangential shear as a function of radius 
r = (ri+f2) /2, in bins that are r-z — r\ = 15 arcseconds wide. 

2.1 Source redshift sampling 

In this paper we identify another source of error, which is 
important for high redshift clusters and at small radii. It 
arises because the amplitude of the lensing signal depends 
on the redshift of the sources. At large distances from the 
cluster, the signal is obtained by averaging over a relatively 
large number of galaxies, thus sampling the average redshift 
distribution fairly well. However, at small radii the num- 
ber of sources is much smaller leading to a large variance 
in the actual redshift distribution n(z). This problem can 
be dealt with using photometric redshifts for the sources, 
but the required increase in observing time may make this 
difficult to achieve in practice. This sampling variance de- 
pends on the width of the redshift distribution and can be 
estimated f rom observations of blank fields (e.g. Jllbert et all 
2006, 2009). This estimate, however, does not account for the 
clustering of the source galaxies, which increases the scat- 
ter further. To include the effects of source clustering one 
can place corresponding apertures in observed fields with 
redshifts and measure the scatter. This approach, however, 
does require a rather large survey area. To quantify how the 
combination of distant (i.e., uncorrelated) large-scale struc- 
ture and variations in the source redshift distribution affect 
weak lensing mass determinations we need a realistic distri- 
bution of source galaxies, which themselves are part of the 
large-scale structure. This requires mock data sets based on 
cosmological numerical simulations of a large area. 




12 3 



Z 



Figure 1. Redshift distribution n(z) of the simulated source 
galaxies, with apparent magnitudes r < 25 (solid line). For com- 
parison redshift histograms for the COSMOS survey (red dotted 
histogram; Ilbert et al., 2009) and the CFHTLS (blue dashed his- 
togram; Ilbert et al., 2005) are shown (same selection in apparent 
magnitude). The difference may be due to limitations of the sim- 
ulation, but may also reflect incompleteness at high redshifts in 
the case of the photometric redshift catalogs. Nonetheless, the 
redshift distribution derived from the simulations is adequate for 
our study. 



2.2 Numerical simulations 

We use results f rom the Millennium Simulation 
jSpringel et all 120051 ). which is a large JV-body simu- 
lation following the evolution of 2160 3 dark matter particles 
in a periodic box with a side length of 500 h~ x Mpc, using 
a flat ACDM cosmolog^Q. The lensing signal is obtained 
from a carefu l ray-t racing analysis presented in detail in 
iHilbert et all (2009). The simulation is carried out by 
dividing the periodically continued matter distribution of 
the Millennium Simulation into 36 slices out to z = 3, each 
with a thickness of « 100 h~ Y Mpc. These are subsequently 
projected onto lens planes, and a set of light rays is 
propagated through the array of planes. Using a set of 
recursion relations, the ray positions on each plane and the 
Jacobian matrices for the light paths from the observer to 
each plane are obtained. Different realizations are obtained 
by choosing different observer positions, in our case yielding 

1 The values for the cosmological parameters that were adopted 
are: a matter density of Q m = 0.25, a cosmological constant with 
SIa = 0.75, a Hubble constant of Ho = 73km/s/Mpc, a spectral 
index n = 1 and a normalisation erg = 0.9 for the primordial 
power spectrum of density fluctuations. 



512 patches of one square degree each. The periodic 
repetition of structures along the line-of-sight, which is 
caused by the finite volume of the Millennium Simulation, is 
minimized by choosing a line-of-sight direction that encloses 
carefully chosen angles with the faces of the simulation 
box. The advantage of this approach is that the matter 
distribution remains continuous across slice boundaries, so 
that correlations on scales larger than the thickness of the 
redshift slices are maintained. 

Information on the properties of galaxies is obtained 
from the semi-ana l ytic m odels of galaxy formation by 
|Pe Lucia fc Blaizotj i|2007l ). Combined with the ray-tracing 
results, this allows us to obtain realistic lensed positions and 
magnitudes for each galaxy, together with shear and conver- 
gence at the galaxies' locations. For our galaxy catalogues, 
we impose a magnitude cut of tsdss < 25. The average 
redshift distribution of these source galaxies is presented in 
Figure [T] The results agree fairly well with phot ometric red- 
shift d istributions from the COSMOS survey ijllbert et alJ 
(2009 ); red dotted hist ogram) and the CFHT Legacy Sur- 
vev |llbert et al.l |2006l ) ; blue dashed histogram) . The actual 
redshift distributions appear to peak at somewhat lower red- 
shift, but this difference is not important for the study pre- 
sented here. 
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2.3 Cluster Signal 

The numerical simulations provide a realistic lensing sig- 
nal that would be observed in a random patch of sky. This 
signal is also present when a cluster of galaxies is studied: 
the observed lensing signal is the combination of that of the 
cluster and the distant large-scale structure (distant in the 
sense that it does not know about the cluster). To simulate 
this, we can simply add the cluster signal to that from the 
simulations. We assume t hat the density profil e of a cluster 
is described by the NFW l|Navarro et alj|l997f ) profile 



p(r) = 



M v i r 



4tt/(c) r(r + r s ) 2 



(6) 



where M v i r is the virial mass, the mass enclosed within the 
radius r v i r . The virial radius is related to the scale radius 
r s through the concentration c = r v i r /r a and the function 
/(c) = ln(l + c) — e/(l + c). By definition, the virial mass 
and radius are related through 



Mvir 



47T . 

~3~ 



~{z)ph g (z)r v 



where pbg = 3HoQ, m (l + z) 3 /(8ttG) is the mean density at 
the cluster redshift and the virial overdens ity A v j r ~ (187r 2 + 
82£ - 39£ 2 )/n m (z), with £ = Q, m (z) - 1 (|Brvan fc Normanl 
1998). For the ACDM cosmology considered here, A v i r (0) = 
337. Expressions for the surface density and tangentia l shear 
of t he NFW profile have been d erived bv lBartelmannl (1 19961 ) 
and IWright fe Brainerdl l|2000l ) and we refer the interested 
reader to these papers for the relevant equations. 

In simulations of collisionless cold dark matter the NFW 
density profile provides a good description of the radial 
mass distribution for halo s with a wide range in mass (e.g., 
iNavarro et allll995l . fl997l ). The density profile is described 
by specifying M v ir and concentration c (or equivalently r 3 ). 
Numerical simulations, however, indicate that the average 
concentration depends on the halo ma s s and the redshif t 
(|Navarro et al.ll 19951 ; iBullock et al]|200ll ; iDuffv et al.ll2008h . 
To account for this correlation we use the relat ion between 
the vi rial mass M v i r and concentration c from IDuffv et al.l 
(2008) who studied numerical simulations using the best 
fit pa rameters of the WMAP5 cosmologjQ (Ko matsu et al.l 
I2009D . The best fit c(M vir ) is given by: 



c = 7.85 



Mvir 



2 x 10 1 



(1 + *)- 



(8) 



Simulations show considerable variation in the density 
profiles, resulting in a lognormal distribution o f c with a 
scatter cri oge ~ .1 for halos of a given mass ( e.g.,ljind[2OO0l : 
iNeto et al.fl2007l ). Furthermore. iNeto etail (120071 ) showed 
that the concentration distributions are different for 're- 
laxed' and 'unrelaxed' halos. Although these physical varia- 
tions are an additional source of uncertainty when attempt- 
ing to constrain the mass-concentration relation observa- 
tionally, they are not relevant for our study of cosmic noise. 

2 This is a different cosmology from the one used to run the 
Millennium Simulation, but we note that the actual choice of 
mass-concentration relation is not important for the main results 
presented in this paper. 
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Figure 2. The dispersion o"lss = (7y) 1 ' 2 introduced by distant 
large scale structure. The solid black line shows the dispersion 
measured from the 512 realisations from the Millennium Simula- 
tion. At small radii the small number of sources introduces addi- 
tional scatter (indicated by the dashed blue curve). The smooth 
red line corresponds to the analytical prediction from Hoekstra 
(2003). The prediction does not account for noise arising from the 
finite number of sources and should be compared to the dotted 
black line (which is corrected for this effect). The prediction is 
about 15% lower, which is due to the adopted non-linear power 
spectrum (see text and Hilbert et al. 2009). 



3 RESULTS 

For each of the 512 realisations from the Millennium Simula- 
tion we measure the azimuthally averaged tangential shear 
as a function of radius from the centre of the image. The 
solid black line in Figure [5] shows the resulting dispersion 
olss = ilr) 1 ^ 2 as deter mined from the 512 r ealisa tions. The 
prediction based on the iPeacock fc Doddsl l|l996h prescrip- 
tion for the non-linear evolution of the power spectrum is 
indicated by the red line. On large scales (> 5') the predic- 
tion is about 15% lower, which we attribute to the adopted 
non-linear power s pectrum. This differe nce is consistent with 
the conclusions of Irlilbert et a l. (2009) who compared vari- 
ous prescriptions for the non-linear power spectrum. 

Close pairs of galaxies are sheared by similar amounts 
if all sources are at the same redshift. In this case, the dis- 
persion in the tangential shear in a given radial bin would 
be small (for a given realisation). This is not true for ac- 
tual observations, because the source redshift distribution is 
broad (see Figure [TJ. At large radii, where the signal is aver- 
aged over many galaxies, the source redshift distribution is 
expected to be close to the average. At small radii this is not 
a good representation, because the small number of sources 
samples the average distribution only sparsely. This leads to 
additional noise if photometric redshifts for the sources are 
not available. Unlike the distant large-scale structure, this 
effect is only relevant at small radii. 
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Figure 3. The dispersion (7^)2 for a cluster with M v j r = 
10 15 M Q with redshift 2 = 0.2,0.4 and 0.6 (solid-black, dashed- 
red and dotted blue lines, respectively). At large radii the cosmic 
shear contribution, which is independent of cluster redshift dom- 
inates the dispersion. On small scales, the dispersion in source 
redshifts increases with lens redshift. 



On small scales we can assume that the dispersion in 
the tangential shear for a single realisation is predominantly 
caused by the spread in source redshifts. The resulting mean 
dispersion as a function of radius is indicated by the dashed 
blue curve in Figure [2] This is the contribution to (tlss 
caused by the finite sampling of the source redshift distri- 
bution. If w e remove this so urce of scatter, the agreement 
between the lHoekstral (|2003h prediction and the Millennium 
Simulation is excellent (as indicated by the dotted line), 
keeping in mind the differ ence in amplitude o f the n on-linear 
power spectrum based on lPeacock fc Doddsl (|l996l ). Hence, 
variations in the actual source redshift distribution lead to 
an increase in the observed variance at radii < 4'. Most of 
the noise caused by the lack of photometric redshifts arises 
from the fact the redshift distribution is broad, but we ex- 
pect that the scatter is boosted by the fact that sources are 
in fact clustered. Comparison with the simulations confirms 
this, but the increase in scatter is modest: the increase is 
only ~ 20% compared to the estimate based on the n(z) 
alone. 

The lack of knowledge of the actual source redshift dis- 
tribution contributes to the uncertainty in the cluster mass 
because it leads to scatter in the ratio /3 = Di s /D s in the 
expression for the critical surface density. For a low redshift 
cluster most sources are at much higher redshifts and (3 ~ 1. 
Consequently the variation in /3 is small. As the redshift of 
the lens is closer to the mean source redshift, the variation in 
Di a /D B increases. This is demonstrated in Figure [3] which 
shows the dispersion for a cluster with mass M vir = 1O 15 M 
at various redshifts. At large radii the variation in the red- 
shift distribution is negligible and the dispersion converges 



Figure 4. Histogram of the best fit virial masses M v ; r for the 
512 realisations for lenses at z = 0.05,0.1,0.2 and 0.4, when we 
adopt the mass-concentration relation from Duffy et al. (2008). 
The red histograms show the distribution of results when only 
distant large scale structure is considered. For low redshifts the 
distribution of masses is skewed, but the average remains unbiased 
(indicated by the vertical dashed line). The black histograms show 
the results when the variation in the source redshift distribution is 
included. Note that these results do not include any shape noise. 



to the cosmic shear signal. At small radii, the scatter caused 
by the variation in the source redshift distribution increases 
rapidly with cluster redshift. Note that deeper observations 
will improve the sampling of the redshift distribution be- 
cause of the larger number of sources. However, at the same 
time the average source redshift will increase, result ing in a 
larger cosmic noise contribution (see lHoekstrall200ll ). 

3.1 Mass estimates 

In this section we study the combined effect of cosmic noise 
and the finite sampling of the source redshift distribution 
on the weak lensing mass estimate of a cluster with a virial 
mass M v i r = 10 15 M Q . We assume it follows an NFW pro- 
file with the concentration given by Eqn. ((8} and add the 
corresponding lensing signal to the shear inferred from the 
ray-tracing analysis, yielding 512 realisations of the cluster 
lensing signal. We fit an NFW model to the resulting lens- 
ing signal out to an outer radius R ou t- The innermost point 
is Y.'5, but the results do not depend much on this choice: 
the small number of sources at these radii means they have 
a low statistical weight in the fit. We consider only shape 
noise as a source of uncertainty and determine the best fit 
masses from a standard least-squares fit. 

We first consider the case where M v i r is the only pa- 
rameter that is fit, because the concentration is specified 
through Eqn. [8] Figure [4] shows the resulting distribution of 
masses for clusters at various redshifts if we fit the NFW 
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Table 1. Dispersion in M v ; r and c 





adopting c(M) 


M vir 


&; c free parameters 




Rout— 10' 


Rout— 25' 


Rout 


=10' 


Rout 


=25' 


■^lcns 




cm 








a c 


LSS only 


0.05 


0.25 


0.22 


0.69 


0.82 


0.43 


0.81 


0.1 


0.15 


0.17 


0.35 


0.53 


0.25 


0.63 


0.2 


0.12 


0.15 


0.21 


0.44 


0.21 


0.59 


0.4 


0.13 


0.18 


0.21 


0.48 


0.23 


0.69 


0.(5 


0.18 


0.24 


0.28 


0.61 


0.29 


0.88 


variation in n(z) only 


0.05 


0.01 


0.01 


0.01 


0.04 


0.01 


0.03 


0.1 


0.02 


0.01 


0.02 


0.08 


0.01 


0.06 


0.2 


0.03 


0.02 


0.03 


0.25 


0.02 


0.23 


0.4 


0.07 


0.06 


0.05 


0.33 


0.05 


0.31 


0.6 


0.10 


0.08 


0.08 


0.45 


0.07 


0.41 




combination of LSS and variation in 


n(z) 




0.05 


0.25 


0.22 


0.73 


0.83 


0.44 


0.81 


0.1 


0.14 


0.16 


0.40 


0.56 


0.24 


0.65 


0.2 


0.11 


0.14 


0.23 


0.54 


0.21 


0.67 


0.4 


0.12 


0.15 


0.22 


0.65 


0.21 


0.82 


0.6 


0.16 


0.22 


0.28 


0.85 


0.26 


1.08 


statistical 


0.05 


0.17 


0.10 


0.93 


1.27 


0.16 


0.66 


0.1 


0.12 


0.08 


0.25 


0.81 


0.10 


0.54 


0.2 


0.10 


0.07 


0.15 


0.65 


0.09 


0.52 


0.4 


0.12 


0.10 


0.15 


0.73 


0.11 


0.63 


0.6 


0.17 


0.14 


0.19 


1.00 


0.15 


0.89 



Dispersions in the values for the best-fit virial mass M v ; r (in units 
of 1O 15 M0) and concentration c as a function of lens redshift and 
the maximum radius that is used in the fit. We list results for the 
effects of distant large-scale structure and variation in the source 
redshift distribution separately and combined. These results do 
not include the statistical error due to the intrinsic ellipticities of 
the source galaxies. The statistical errors (but now without LSS 
contributions) are given in the fourth set of results. 

model out to .Rout = 10'. For low redshifts the distribution 
is somewhat skewed, but the mean value is unbiased (as in- 
dicated by the vertical dashed lines). Table [T] lists the values 
for the scatter cfm caused by the combined effects of cos- 
mic noise and source redshift variation. Comparison with 
the statistical errors (computed assuming a total intrinsic 
source ellipticity of 0.3) shows that the cosmic noise contri- 
bution is quite comparable at all redshifts. Cosmic noise is 
minimal at intermediate redshifts (0.2 < z < 0.4). 

The increase for z > 0.4 is caused by the fact that the 
angular extent of the cluster decreases, whereas the aperture 
.Rout is kept fixed. The dashed lines in Figure [5] show how 
the cosmic noise depends on -R ou t- For reference, Table [1] 
also lists the values for om for Rout = 25'. For z — 0.05 
(black line) the cosmic noise decreases with aperture size, 
but at higher redshifts it increases. However, the statistical 
uncertainty decreases with increasing Rout, as indicated by 
the dotted lines. The solid lines indicate the net result: for 
z = 0.05 there is a net gain, but for a cluster with z > 0.2 
(blue line) there is no benefit extending the fit beyond 10'. 




o.i " 



500 1000 1500 
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Figure 5. The solid lines show the total uncertainty in the best- 
fit virial mass (forM vir = 1O 15 M ) as a function of the maximum 
radius used to fit the NFW for a lens redshift of z = 0.05 (black), 
z = 0.1 (red) and z = 0.2 (blue). The contribution from the shape 
noise (i.e., statistical error) is indicated by the dotted curves, 
whereas the dashed lines show the LSS contribution. For z = 
0.05 there is a clear benefit from wide-field imaging data, but at 
z = 0.2 the uncertainty is flat for R ou t > 10'. 

We also compare the relative contributions of the dis- 
tant large-scale structure and the variation in n(z). The lat- 
ter is computed by using the simulated redshift distribution, 
but without adding the cosmic noise to the cluster signal. To 
compute the former, we add the cluster shear computed for 
the average source redshift to the ray-tracing results. Inter- 
estingly, the combined effect of LSS and variation in n(z) is 
to slightly reduce the scatter in the recovered masses, com- 
pared to the LSS-only case. This can be easily understood: 
a structure at lower redshift will increase the lensing signal, 
but will also increase the number of sources at these red- 
shifts. As the latter are lensed less than the average source, 
they partly offset the increase in lensing signal. Note that 
the combined effect does not bias the cluster mass estimates. 

The lensing signal of high redshift clusters can be 
boosted by removing foreground galaxies using photometric 
redshift information and optimally weighing the remaining 
sources based on their Di s /D s . However, the cosmic noise 
increases also rapidly with source redshift: ctlss oc z 1A for 
(z < 1), which might limit the expected improvement in 
precision. The redshift distribution used here drops quickly 
beyond z ~ 1 and we find that for clusters with z > 0.4 
the photometric redshift information does improve the mass 
measurements. 



3.2 Joint constraints on mass and concentration 

So far we examined the effect of cosmic noise when one as- 
sumes a mass-concentration relation which is based on nu- 
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merical simulations. Instead, many studies fit the lensing 
signal with both M v i r and c as free parameters. This allows 
one to directly constrain the conce ntration and therefore test 
the numerical simulations (e.g., Clowe Sz Schneider 20021; 
Hoekstra et al.ll2002l ; lMandelbaum et al.ll2008l ; lOkabe et all 
20101 ). Cosmic noise, however, significantly increases the for - 
mal uncertainties in such measurements l|Hoekstral [2003). 
Figure[6]shows the distribution of best fit values for M v i r and 
concentration c for the 512 realisations when fitting both 
parameters simultaneously. The contours indicate the sta- 
tistical uncertainties in the parameters, whereas the points 
show the spread due to cosmic noise and variation in the 
source redshift distribution. Table [1] lists the scatter in the 
parameters. It is clear that cosmic noise has a large impact 
on the ability to constrain the concentrations. In particular, 
note the outliers with high inferred masses and low concen- 
trations. 

We also examined whether cosmic noise biases the slope 
of the mass-concentration relation that i s inferred from stud - 
ies of samples of clusters. For instance, lOkabe et al.l (|2010l ) 
obtain a power-law slope of 0.40±0.19, which is steeper than 
is seen in numerical simulations. They examined the corre- 
lation in parameters due to the shape noise for simulated 
rofiles with masses M vil = 0.2- 1.5 x 10 15 Mm. lOkabe et all 



2010) find a bias of 0.06, much smaller than the observed 
value. We performed a similar test and find that cosmic noise 
also biases the inferred slope, but by a similar amount. The 
combined effect of shape and cosmic noise is a bias of only 
0.08 in the slope, because the correlation between M v i r and 
c is similar for both sources of error. We note that the in- 
ferred slope is steeper if smaller range in mass is considered: 
we find a bias of 0.17 for a range M vir = 1 - 1.5 x 1O 15 M . 
Hence, the most important consequence of including cosmic 
noise is to reduce the significance of the measurement of the 
slope of the mass-concentration relation. 



4 REDUCING COSMIC NOISE 

The results presented in the previous section indicate that 
weak lensing studies should include cosmic noise in their 
error budget. An interesting question is whether one can re- 
duce, or even remove, the eff ects of cosmic no ise. A statistical 
approach was discussed by iDodelsonl {2004) who proposed 
a minimum variance estimator to account for cosmic noise 
in mass reconstructions. A concern, however, is that sub- 
structures associated with the cluster might be suppressed 
as well. 



4.1 Accounting for additional clusters 

In this section we will explore whether the observations 
themselves can be used to reduce the cosmic noise. Although 
one can imagine many different ways to predict the cosmic 
noise signal, we will consider a relatively simple method. It 
requires only a minimum of colour information and is there- 
fore readily available: most studies include (some) color in- 
formation to identify cluster members. 

Massive collapsed structures, such as galaxy clusters 
and groups of galaxies contribute a large fraction of the 
power on the physical scales relevant for cosmic noise. For- 
tunately they can be identified in multi-colour data, similar 
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Figure 6. Distribution of best-fit M v ; r and c when both param- 
eters are free to vary. The points indicate the spread in results 
when distant large-scale structure and source redshift variation 
are included. The contours indicate the regions that enclose 68% 
and 90% of the fits when only statistical (shape) noise is con- 
sidered. The red line shows the mass-concentration relation from 
Duffy et al. (2008). 



to what is done in optical cluster surveys. The most massive 
syste ms can readily be locat ed using a red-sequence method 
(e.g., iGladders fc Yee3l2000l) . Photometric redshifts, involv- 
ing more colours, can be used to find lower mass halos. For 
instance, iMilkeraitis et all l|2010l ) used the Millennium Sim- 
ulation to examine how well one can identify clusters using 
five optical filters. They find that clusters with masses larger 
than ~ 5 x 10 13 can be detected with fairly high complete- 
ness (~ 80%) and low false detection rate (~ 20%). 

After having identified the clusters one needs to esti- 
mate their contribution to the lensing signal. Here we take 
an optimistic approach and assume we can find all halos 
down to a virial mass limit M\ lm . In practice such a clear 
mass limit may be more difficult to achieve. We fit these 
halos simultaneously with the cluster of interest (where we 
ignore shape noise and assume the halos follow an NFW 
profile with our adopted mass-concentration relation). For 
a limiting mass My lm = 5 x 10 13 Mq on average 5.4 halos 
are fit in addition to the input cluster (with actual numbers 
ranging from to 12). We find that this procedure does not 
bias the recovered cluster mass. 

Figure [7] shows the resulting scatter in the best fit virial 
mass as a function of the mass limit of the halos included 
in the fit. The solid lines show the results when the NFW 
model is fit out to 10', whereas the dashed lines show the 
results for Rout = 25'. For reference, the left panel indicates 
the corresponding statistical uncertainties in the virial mass 
due to the shape noise. 

The results suggest our simple approach is indeed able 
to reduce the effect of cosmic noise. Figure [7] shows that this 
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has proven to be useful to study the clustering of galaxies 
and interpret the galaxy-mass cross-correlation function. On 
small scales the Poisson term dominates. It is given by 



Avf(v)^±\y{k,M{v))\ 2 . 



(9) 



where p is the mean matter density and y(k, M) is the ratio 
of the Fourier transform of the halo profile p(k) and the 
halo mass M(u). The peak height v of such an overdensity 
is given by 



S c (z) 



a[M) 



(10) 



where & c is the value of a spherical overdensity at which 
it collapses at a redshift z. a(M) is the rms fluctuation in 
spheres that contain mass M at an initial time, extrapolated 
to 2 using linear theory. The function f(v) is related to the 
halo mass function dn /AM through 



Figure 7. The scatter in the best-fit virial mass (for M v ; r = 
1O 15 M0) as a function of Mn m , the minimum mass of halos that 
are fit simultaneously with the cluster of interest. The improve- 
ment is largest for a cluster at z = 0.05 (black line). The benefits 
are smaller for z = 0.1 (red line) and z = 0.2 (blue line). The 
solid lines show the results when the NFW model is fit out to 
10', whereas the dashed lines show the results for Bout = 25'. For 
reference, the left panel shows the statistical uncertainty in the 
virial mass due to the intrinsic shapes of the source galaxies. 



is most relevant for clusters at very low redshifts. However, 
even with a (low) mass limit of M\x m — 5 x 1O 13 M0, the 
cosmic noise remains a dominant source of uncertainty. We 
have examined several other approaches, such as using the 
luminosities of galaxies to predict the lensing signal, and 
found that none is able to significantly improve upon the 
relatively simple approach outlined above. We now will at- 
tempt to understand why this is. 



4.2 Limitations 

To compute the cosmic noise signal in §3 we need the non- 
linear power spectrum of density fluctuations to account 
for the fact that collapsed halos increase the power on 
small scales. In general it is computed using (fitting func- 
tions to) numerica l simu lations, such as the prescription of 
iPeacock fc Doddsl (1 19961 ') that we used here. The observa- 
tion that dark matter halos are well described by NF W pro- 
files a llows for an analytic approach as suggested bv lSeliakl 
(2000). In this model the abundance of halos is given by 
the halo mass function and their clustering is described by 
a mass dependent bias relation. The dark matter profiles 
themselves are described by spherical NFW profiles that 
are functions of the mass only (i.e., they follow a mass- 
concentration relation). The resulting power spectrum is the 
sum of the contribution from a Poisson term that corre- 
sponds to individual halos P F (k) and a term arising from 
the clustering of halos P hh (k) themselves. This halo-model 



4^dM = 4-J{y)Av. (11) 
AM M JK ' y ' 

We use the expressions from Sheth et al. I teOOll ) for f{y) 
and the M(c) relation from iDuffv et al.l ( 20081 ) to compute 
the Poisson term. The halo-halo term is important on large 
scales and is computed by integrating over the mass function 
with the halo bias b(v) and Fourier transform of the density 
profile 



P hh (fc) = Pnn(fc) / dvf{u)b{u)y{k,M(u)) 



(12) 



where Pu n (k) is the linear power spectrum. 

The black line in Figure [8] shows the cosmic noise (LSS- 
only) measured from the simulations. The dashed black line 
is the halo model prediction, where the Poisson term has 
been multiplied by 1.15 to match the simulations. The solid 
red line in Figure [8] shows ctlss if we fit all halos with 
Mvir > 5 x 1O 13 M0 as described in the previous section. 
The theoretical limit of the reduction in power by account- 
ing for massive halos along the line-of-sight is obtained by 
integrating Eqns. @ and ()12|) up to Mii m , rather than ex- 
tending the integral over all masses. The dashed line in Fig- 
ure[8]shows the corresponding result for Mn m = 5 x 10 13 Mq. 
It is clear that this estimate overestimates the reduction in 
cosmic noise, compared to the actual simulated results. 

The reason for this is simple: the theoretical limit im- 
plicitely assumes that the halo masses were determined per- 
fectly, which clearly is too optimistic. Differences in the true 
mass M t and the fitted mass Mi add additional power to the 
theoretical limit. For the Poisson term the residual power 
P p res is given by: 



-Prcs(^) 



(27T) 



df/M — f-^ x 



AM t \y(k,M t )-y(k,M t )\ 2 W(M t -M f ), 



(13) 
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Figure 8. The cosmic noise signal (LSS-only) as a function of 
radius. The black line is the total signal and the dashed black 
line is the signal from the halo model. The Poisson term in the 
halo model has been multiplied by 1.15 to match the simulations. 
The solid red line indicates the cosmic noise when halos with 
M vir > 5 x 1O 13 M are included in the fit (see text for details). 
The red dashed line shows the halo model signal if such halos are 
removed perfectly. The dashed region indicates our estimate for 
the range in scatter when the uncertainties are taken into account. 



where W(M t — Mf) describes the distribution of the differ- 
ence between the true and recovered masses. Comparison 
with the simulations shows that W can be approximated by 
a Gaussian with a dispersion a that depends on the halo 
mass, with a = 2.3 x 1O 13 M + 0.28 x M t . We need to add 
the contribution P hh to P^ cs , which in the ideal case is inte- 
grated up to Mii m . However, we cannot fit the contributions 
from halos outside the field of view and the actual halo-halo 
contribution will lie between the ideal case and the full P h . 

The shaded region in Figure [8] indicates the expected 
range in oxss when we account for the uncertainties in the 
modelling. The actual results agree very well with our es- 
timates based on the halo model. In reality the situation 
is even more dire, because we ignored shape noise in the 
calculations presented here. 



5 CONCLUSIONS 

We used the Millennium Simulation to study how large-scale 
structure along the line-of-sight (cosmic noise) affects the 
uncertainty in the weak lensing masses of clusters of galax- 
ies. After accounting for differences in the calculation of the 
non-linear power spectrum of density fluctuations, analytical 
estimates agree well with the si mulations. The simula tions 
therefore support the findings bv lHoekstral |200ll . 2003) that 
cosmic noise is a relevant source of uncertainty in weak lens- 
ing mass determinations and therefore should be included in 



the reported error budget. We do note that the adopted as 
in the simulation is higher than the currently favoured value, 
which reduces the amplitude of the cosmic noise somewhat. 

We also examined whether variations in the source 
galaxy redshift distribution are an important source of un- 
certainty. Although the importance increases with the red- 
shift of the cluster, we find it is never significant when com- 
pared to statistical errors or cosmic noise. For the simulated 
redshift distribution of sources used here we find that source 
redshift information improves the precision of the mass mea- 
surement, because the boost in lensing signal by the removal 
of foreground galaxies is larger than the increase in cosmic 
noise due to the increase in the mean source redshift. 

Finally we examined whether it is possible to reduce 
the effect of cosmic noise by identifying galaxy clusters and 
groups along the line-of-sight. Such structures can be lo- 
cated fairly easily in multi-colour data. We study a simple 
approach where we fit the masses of these additional struc- 
tures down to a mass Mu m and find that cosmic noise can 
indeed be reduced, in particular for clusters at very low red- 
shifts (z ~ 0.05). Nonetheless, the cosmic noise remains a 
dominant source of uncertainty. To better understand the 
limitations of modelling the contribution from distant large- 
scale structure, we computed the expected signals using the 
halo model. We find that the uncertainties (or variations) in 
the profiles fundamentally limit the suppression of cosmic 
noise. As a consequence, cosmic noise will remain a domi- 
nant source of uncertainty in weak lensing cluster mass mea- 
surements, and should not be ignored. 
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